On the Construction of Initial Basis Function for Efficient Value Function Approximation

نویسندگان

  • Chung-Cheng Chiu
  • Kuan-Ta Chen
چکیده

We address the issues of improving the feature generation methods for the value-function approximation and the state space approximation. We focus the improvement of feature generation methods on approaches based on the Bellman error. The original Bellman-error-based approaches construct the first basis function as an arbitrary nonzero vector. This kind of design results an inefficient generation of the basis functions. We propose a method to construct the first basis function that models the structure of the value-function. Our method improves the efficiency of existing feature generation algorithms and derives a more precise model for value-function approximation. We also propose to use the relevance vector machine to find a sparse state representation and project the original high-dimensional state space to the resulting low-dimensional state space. Our framework shows improved performance on existing benchmark problems, and is also effective on a car racing problem.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Approximation of a Fuzzy Function by Using Radial Basis Functions Interpolation

In the present paper, Radial Basis Function interpolations are applied to approximate a fuzzy function $tilde{f}:Rrightarrow mathcal{F}(R)$, on a discrete point set $X={x_1,x_2,ldots,x_n}$, by a fuzzy-valued function $tilde{S}$. RBFs are based on linear combinations of terms which include a single univariate function. Applying RBF to approximate a fuzzy function, a linear system wil...

متن کامل

Optimal Pareto Parametric Analysis of Two Dimensional Steady-State Heat Conduction Problems by MLPG Method

Numerical solutions obtained by the Meshless Local Petrov-Galerkin (MLPG) method are presented for two dimensional steady-state heat conduction problems. The MLPG method is a truly meshless approach, and neither the nodal connectivity nor the background mesh is required for solving the initial-boundary-value problem. The penalty method is adopted to efficiently enforce the essential boundary co...

متن کامل

A numerical solution of a Kawahara equation by using Multiquadric radial basis function

In this article, we apply the Multiquadric radial basis function (RBF) interpo-lation method for nding the numerical approximation of traveling wave solu-tions of the Kawahara equation. The scheme is based on the Crank-Nicolsonformulation for space derivative. The performance of the method is shown innumerical examples.

متن کامل

TBM Tunneling Construction Time with Respect to Learning Phase Period and Normal Phase Period

In every tunnel boring machine (TBM) tunneling project, there is an initial low production phase so-called the Learning Phase Period (LPP), in which low utilization is experienced and the operational parameters are adjusted to match the working conditions. LPP can be crucial in scheduling and evaluating the final project time and cost, especially for short tunnels for which it may constitute a ...

متن کامل

Minimizing a General Penalty Function on a Single Machine via Developing Approximation Algorithms and FPTASs

This paper addresses the Tardy/Lost penalty minimization on a single machine. According to this penalty criterion, if the tardiness of a job exceeds a predefined value, the job will be lost and penalized by a fixed value. Besides its application in real world problems, Tardy/Lost measure is a general form for popular objective functions like weighted tardiness, late work and tardiness with reje...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009